A State Aggregation Approach to Singularly Perturbed Markov Reward Processes

نویسندگان

  • Dali Zhang
  • Baoqun Yin
  • Hongsheng Xi
چکیده

In this paper, we propose a single sample path based algorithm with state aggregation to optimize the average rewards of singularly perturbed Markov reward processes (SPMRPs) with a large scale state spaces. It is assumed that such a reward process depend on a set of parameters. Differing from the other kinds of Markov chain, SPMRPs have their own hierarchical structure. Based on this special structure, our algorithm can alleviate the load in the optimization for performance. Moreover, our method can be applied on line because of its evolution with the sample path simulated. Compared with the original algorithm applied on these problems of general MRPs, a new gradient formula for average reward performance metric in SPMRPs is brought in, which will be proved in Appendix, and then based on these gradients, the schedule of the iteration algorithm is presented, which is based on a single sample path, and eventually a special case in which parameters only dominate the disturbance matrices will be analyzed, and a precise comparison with be displayed between our algorithm with the old ones which is aim to solve these problems in general Markov reward processes. When applied in SPMRPs, our method will approach a fast pace in these cases. Furthermore, to illustrate the practical value of SPMRPs, a simple example in multiple programming in computer systems will be listed and simulated. Corresponding to some practical model, physical meanings of SPMRPs in networks of queues will be clarified. Keywords—Singularly perturbed Markov processes, Gradient of average reward, Differential reward, State aggregation, Perturbed close network.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ergodic Control of a Singularly Perturbed Markov Process in Discrete Time with General State and Compact Action Spaces

Ergodic control of singularly perturbed Markov chains with general state and compact action spaces is considered. A new method is given for characterization of the limit of invariant measures, for perturbed chains, when the perturbation parameter goes to zero. It is also demonstrated that the limit control principle is satisfied under natural ergodicity assumptions about controlled Markov chain...

متن کامل

Asymptotic linear programming and policy improvement for singularly perturbed Markov decision processes

In this paper we consider a singularly perturbed Markov decision process with ®nitely many states and actions and the limiting expected average reward criterion. We make no assumptions about the underlying ergodic structure. We present algorithms for the computation of a uniformly optimal deterministic control, that is, a control which is optimal for all values of the perturbation parameter tha...

متن کامل

Singularly Perturbed Finite Markov Chains with General Ergodic Structure

We analyse singularly perturbed Markov chains. Most previous research has been done under the assumption that the perturbed Markov chain is either ergodic or unichain. In this paper we do not impose any restrictions on the ergodic structure of the perturbed chain. The present approach is based on the inversion of analytic matrix-valued functions.

متن کامل

Numerical method for singularly perturbed fourth order ordinary differential equations of convection-diffusion type

In this paper, we have proposed a numerical method for singularly perturbed  fourth order ordinary differential equations of convection-diffusion type. The numerical method combines boundary value technique, asymptotic expansion approximation, shooting method and  finite difference method. In order to get a numerical solution for the derivative of the solution, the given interval is divided  in...

متن کامل

A method based on the meshless approach for singularly perturbed differential-difference equations with Boundary layers

In this paper, an effective procedure based on coordinate stretching and radial basis functions (RBFs) collocation method is applied to solve singularly perturbed differential-difference equations with layer behavior. It is well known that if the boundary layer is very small, for good resolution of the numerical solution at least one of the collocation points must lie in the boundary layer. In ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006